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A stronger bound for the strong chromatic index 

Henning Bruhn and Felix Joos 

Abstract 

We prove XslG) < 1.93A(G)^ for graphs of sufficiently large maximum 
degree where Xs (G) is the strong chromatic index of G. This improves an 
old bound of Molloy and Reed. As a by-product, we present a Talagrand- 
type inequality where it is allowed to exclude unlikely bad outcomes that 
would otherwise render the inequality unusable. 

1 Introduction 

Edge colorings are well understood, for strong edge colorings this is much less 
the case. An edge coloring can be viewed as a partition of the edge set of a graph 
G into matchings; the smallest such number of partition classes is the chromatic 
index of G. If we consider the natural stronger notion of a partition into induced 
(or strong) matchings, we arrive at the strong chromatic index Xs(d') of G, the 
minimal number of induced matchings needed. 

The classic result of Vizing, and independently Gupta, constrains the chro¬ 
matic index of a (simple) graph G to a narrow range: it is either equal to the 
trivial lower bound of the maximum degree A(G), or one more than that. The 
strong chromatic index, in contrast, can vary much more. The trivial lower 
bound and a straightforward greedy argument give a range of A(G) < Xs(G') < 
2A(G)^ — 2A(G) -I-1 for all graphs G. Erdos and Nesetfil [7] conjectured a much 
stricter upper bound: 

Strong edge coloring conjecture. Xs(G') < |A(G)^ for all graphs G. 

If true, the conjecture would be optimal, because any blow-up of the 5-cycle 
as in Figure [T] attains equality. For odd maximum degree, Erdos and Nesetfil 
conjectured that Xs(G') < |A(G)^ — ^A(G) -I- 1, which again would be tight. 

In a breakthrough article of 1997, Molloy and Reed [H] demonstrated how 
probabilistic coloring methods could be used to beat the trivial greedy bound: 

X'(G) < 1.998A(G)2 

for graphs G with A(G) sufficiently large. 

We improve this bound: 

Theorem 1. If G is a graph of sufficiently large maximum degree A, then 

x's{G) < 1.93A2. 

A strong edge coloring of a graph G may be viewed as an ordinary vertex 
coloring in the square T^(G) of the linegraph of G. (The square of any graph is 
obtained by adding edges between any vertices of distance of most 2.) Working 
in T^(G) permitted Molloy and Reed to split the strong edge coloring problem 
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Figure 1: Two blow-ups of the 5-cycle 


into two weaker sub-problems. First, they showed that the neighborhood of any 
vertex in T^(G') is somewhat sparse. Second, based on a probabilistic coloring 
method, they proved a coloring result for graphs with sparse neighborhoods, 
that holds for general graphs, not only for squares of linegraphs. 

We follow these same steps but make a marked improvement in each sub¬ 
problem: our sparsity result is asymptotically best-possible; and our coloring 
lemma needs fewer colors. We discuss the differences between our approach and 
that of Molloy and Reed in detail in Sections [5] and [S] 

As a tool for our coloring lemma we develop in Section [7] a version of Tala- 
grand’s inequality that excludes exceptional outcomes. Talagrand’s inequality is 
used to verify that random variables on product spaces are tightly concentrated 
around their expected value. It is particularly suited for random variables that 
only change little when a single coordinate is modified. This will not be the 
case in our application: in some very rare events a single change might result 
in a very large effect. To cope with this, we formulate a version of Talagrand’s 
inequality in which such large effects of tiny probability can be ignored. We 
take some effort to make the application of the inequality as simple as possible 
as we have some hopes that it might be useful elsewhere. 

A weakening of the strong edge coloring conjecture yields a statement on 
strong cliques, the cliques of the square of the linegraph. 

Conjecture 2. Any strong clique of a graph G has size at most uj{L'^{G)) < 
|A2(G). 

Note that the edge set of any blow-up of the 5-cycle is a strong clique, so 
that the conjecture would be tight. Not much is known about this seemingly 
easier conjecture. Chung, Gyarfas, Tuza and Trotter [1] showed that any graph 
G that is 2K2-iree has at most |A^(G) edges. In such a graph, the whole 
edge set forms a strong clique. Faudree, Schelp, Gyarfas and Tuza [8] found an 
upper bound of (2 — e)A^(G) for the size of any strong clique, for some small e. 
Bipartite graphs are easier to handle in this respect: the same authors proved 
that the strong clique can never have size larger than A(G)^. Again, this is 
tight, as balanced complete bipartite graphs attain that bound. 

We prove: 

Theorem 3. If G is a graph with maximum degree A > 400, then its strong 
clique has size at most ijj{Lf(G)) < 1.74A^. 

Coming back to strong edge colorings, let us note that there are a number 
of results for special graph classes. The conjecture was verified for maximum 
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degree 3 by Andersen [5], and independently by Horek, Qing and Trotter m- 
For A(G) = 4, Cranston achieves a bound of x'si^) ^ 22, which is off 
by 2. Mahdian [TB] proved that x's{G) < iog,^G) ^ contain 4-cycles 

by quite involved probabilistic methods. Finally, a number of works concern 
degenerated graphs, the earliest of which is by Faudree, Schelp, Gyarfas and 
Tuza [5] who established the bound Xs(G) < 4A(G) -I- 4 for planar graphs G. 
Kaiser and Kang m consider a generalization of strong edge colorings, where 
alike colored edges have to be even farther apart. 

The use of probabilistic methods to color graphs is explored in depth in the 
book of Molloy and Reed [5D], where also many more references can be found. 
We only mention additionally the article of Alon, Krivelevich and Sudakov [T] on 
coloring graphs with sparse neighborhoods. However, their result only implies 
something nontrivial if the neighborhoods are much sparser than we can expect 
in squares of linegraphs. 

Strong edge colorings seem much more difficult than edge colorings. This 
is because already induced matchings are much harder to handle than ordinary 
matchings. While the size of a largest matching can be quite precisely be de¬ 
termined, it is even hard to obtain good bounds for induced matchings; see for 
instance [TT1I12) . 

All our graphs are simple and finite. We use standard graph-theoretic nota¬ 
tion and concepts that can be found, for instance, in the book of Diestel [5]. 


2 Outline and proof of Theorem [T] 


A strong edge coloring of a graph G is nothing else than an ordinary vertex 
coloring in L^(G), the square of the linegraph of G. (The square of a graph is 
obtained by adding an edge between any two vertices of distance 2.) Molloy 
and Reed [19] use this simple observation to prove their bound on the strong 
chromatic index in two steps. 

First, they show that neighborhoods in L^{G) cannot be too dense. To 
formulate this more precisely denote by for any edge e of G the set of 
edges of distance at most 1 to e, which is equivalent to saying that is the 
neighborhod of e in L^{G). We will often call Nl the strong neighborhood of e. 
Molloy and Reed show that for every edge e 

Nl induces in L^{G) at most (l — ) edges, (1) 

where A is the maximum degree of G. 

In the second step, Molloy and Reed show that any graph with sparse neigh¬ 
borhoods, such as L?{G), can by colored with a probabilistic procedure. 

Following this strategy, we also prove a sparsity result and a coloring lemma. 


Lemma 4. Let G be a graph of maximum degree A > 1, and let e be an edge 
of G. Then the neighborhood Nf of e induces in L‘^{G) a graph of at most 
|A^ 4- 5 A® edges. 

Lemma 5. Let 7,5 G (0,1) be so that 


7 < 


S 


1 — 7 


J3/2 

6 ( 1 - 7 ) 2 ® 


7 

8 ( 1 -^) , 


( 2 ) 
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Then, there is an integer R so that for all r > R it follows xi^) ^ (1 ~ t)''’ for 
every graph G with maximum degree at most r in which, for every vertex v, the 
neighborhoods N{v) induce graphs of at most (1 — < 5 )( 2 ) edges. 

Condition ([5]) might be slightly hard to parse. Therefore, let us remark that 
in a range of ^ S (0, 0.9] 

7 = 0.1827-(5- 0.0778-(5^/2 

satisfies the condition and is not too far away from the best-possible 7 . 

Our main theorem is a direct consequence of the two lemmas. 

Proof of Theorem [7J Let G be a graph with maximum degree A sufficiently 
large. With Lemma |4| we conclude that for every vertex v of L‘^{G) the neigh¬ 
borhood induces a graph of at most (| -I- o(l))(^^ ) edges. Therefore, we can 
apply Lemma[5|with r = 2A^, 5 = 0.24, and 7 = 0.035. □ 

While Molloy and Reed developed this very neat proof technique, our con¬ 
tribution consists in improving its two constituent steps. In particular, in the 
sparsity lemma we improve Molloy and Reed’s ^ in (CD to roughly j. This is 
almost as good as possible: in Section|3|we construct graphs that asymptotically 
reach the upper density bound of Lemma Sj Our coloring lemma also yields a 
7 that is somewhat smaller than the corresponding 7 of Molloy and Reed. We 
discuss the differences between their coloring lemma and ours in more detail in 
Section [5| 

Let us have a look at some concrete numbers. There is a small oversight 
in the proof of Molloy and Reed (a lost 2) that results in the actual bound of 
x'si^) — 1.9987A(G)^ instead of the claimed x'si^) ^ 1.998A(G)^. 

To what improvements lead our lemmas? With our sparsity lemma and the 
coloring lemma of Molloy and Reed it is possible to obtain x's(.G) < 1.99A(G)^. 
Our coloring lemma then leads to the factor 1.93 in our main theorem. Viewing 
all statements as bounds of the form Xs(G) < (2 — e)A(G)^, we improve e by a 
factor of 53. 

Finally, even assuming that the coloring lemma could be vastly improved 
(doubtful), this method can never go beyond 1.73A(G)^. The sparsity bound 
in Lemma [d] which, again, is asymptotically best-possible, does not exclude 
a clique of size 1.73A(G)^ in the neighborhood iV| in Lf{G). That means, 
just considering edge densities will never yield a factor smaller than 1.73 (and 
probably not even close to that number). 


3 Density of the strong neighborhood 


In this section we prove the sparsity lemma, Lemma |T| For any edge e in a 
graph G, we write d^{e) for the degree in Lf{G). Then d^{e) = |V||. 


Lemma 6. Let G 

Then 


be a graph of maximum degree A, and let e be an edge of G. 
d^e) <{2-a- I3)A^ - 2A, 


where aA is the number of triangles in G containing e, and where /3A^ is 
the number of 4,-cycles containing e plus the number of triangles incident with 
exactly one endvertex of e. 
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Figure 2: Some of the parameters used in the proof of Lemma 0] 


Proof. Let e = uv. Then \N{u) U N{v) \ {u, n}! < 2(A — 1) — \N{u) fl N{v)\ = 
(2 —a)A —2. Every edge in has at least one of its endvertices in N{u)UN{v). 
Edges with both endvertices in N{u) U N(v) lie in a common 4-cycle with e or 
form a triangle with either u or v, and thus count towards /3A^. In total we 
obtain 

d=(e) < \N{u) U N{v) \ {it, ?;}| • A - /JA^ < ((2 - a)A - 2)A - /JA^, 
which gives the affirmed bound. □ 

We remark that the inequality given in Lemma [5] is even an equality if G is 
A-regular. We will frequently use the observation that the containment of an 
edge / in fc 4-cycles implies cP{f) < 2A^ — k. 

We will occasionally need to measure how many neighbors a vertex has in 
some subset of the vertex set. So, for a vertex u in a graph G and some set 
X C V{G), we write dx{v) for \Nc{v) fl A|. We will similarly use the notation 
dp{e) for the degree of an edge e in L‘^{G) restricted to some edge set F. 

Proof of Lemma 0] Without loss of generality, we may assume G to be A- 
regular. Indeed, if G is not then it may be embedded into a A-regular graph, in 
which the number of edges induced by will only be larger. (The embedding 
is a standard technique: iteratively make a second copy of G and connect copies 
of vertices of too low degree by an edge.) 

Let e = uv and set X = Ng{u) U Ng{v) \ {u,v}, while Y = Ng{X) \ (X U 
{m,i:}). Then, using the notation of Lemma [51 we set aA = \Ng{u) fl AG(n)| 
to be the number of triangles containing e, and let /3A'^ be the number of edges 
with both endvertices in X. Observe that |A| = (2 — a)A — 2. We furthermore 
note that 0 < a, /3 < 1; for /3 this follows from 2,5A^ < |A|-A. We denote by rrie, 
the number of edges induced by in L‘^{G). Our objective is to upper-bound 
me- _ 

Let us start with the estimation of m^. Writing for E{G) \ we obtain 

2me = = E - %(/)) 

feNi feNi 

To get an upper bound on use Lemma [51 where we omit the 

triangles and only count those 4-cycles that consist entirely of edges between X 
and Y. Let Gj^ ’ be the number of those 4-cycles, that is, the number of those 
4-cycles that consist only of edges with one endvertex in X and the other in Y. 
Observe that each cycle counted by G^ ' reduces the degree of four edges in 
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Thus, J2feN‘ '^^(/) — (S/GAf" 2A^) — 4(7^’^, by Lemma[Hl Using the lemma 
again, this time to upper-bound |iV||, and substituting in our above estimation 
of rrie, we get 

2me < 2A^\Nl\ - 4Cf^ d^(/) 

< 2A^ ((2 - a - /3)A2 - 2A) - ^ d^(f) 

/ 6 WJ 

= 2(2 - a - /3)A4 - 4A3 - 4Cf^ d^(f) 

f&Nl 

In order to obtain a lower bound on ^/gArs we consider paths of the 

form pxyq in G, where x £ X, y £ Y and q ^ X. The first edge px then is in 
Nl, while the last edge yq is outside. So, each such path pxyq contributes 1 to 
J2fem '%r(/)- Since G is A-regular, there are A — 1 such paths for each fixed 
xyq. Counting the number of such xyq we arrive at ^xiy)iA — dx{y))- 

For later use, we give this parameter a name 

7^^ := X! ^x{y){A - dxiy)), 

V<^Y 


and observe that 0 < 7 < Indeed, for any y £Y,we have dx{'y){A—dx{y)) < 
dx{yY/'^ < ^dx{y) and consequently, yA^ < f ' dxiy) < f ■ 2A^, as 

there can be at most 2A^ edges between X and Y. 

Summing up our discussion: 

^ d^i!) > (A - 1)7A3, 

/ 6 Af| 


which leads to 

me < (2 - a - /3 - I) A^ - + (J " 2) (3) 

It remains to estimate G 4 ’ . For this, let us first compute mx,Y, the number 
of edges with one endvertex in X and the other in Y. On the one hand, we have 

doi^) = rnx,Y + 2/3A^ + |X|, 

while A-regularity, on the other hand, gives us Yxex daix) = A ■ |X|. Together 
with |A| = (2 — Q!)A — 2, this implies 


= (2 - a - 2/3)A2 - (4 - a)A-h 2. (4) 


For Xi^X 2 £ X define c(a:i, X 2 ) to be the number of common neighbors of xi 
and X 2 in Y, where we put c(a;i, 0 : 2 ) = 0 if xi = X 2 - Then 




= E 

Xi,X2^X 


cixi,X2, 

2 
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We compute 


\ / , ' - 


xi,x2^X y^y 

Using the definition of 7 , this gives 

1 


y(^Y 


c{xi,x2)=-\^Y^yy'^~^y\~o 


mx,Y 


X\,X2^X 


y^Y 


= - ((A - l)mx,y - 7^^) • 


Using mx,Y < 2 A 2 as well as ©, we get a lower and an upper bound: 

i ((2 - a-2/3 - 7 ) A^ - GA^) < Y c{xi,X 2 ) < {l (5) 


Xi,X2£X 


We come back to the calculation of C' 


X,Y 


X\,X2^X ^ ' X\,X2^X 

>\ Y c{xuX2f-\(l-l)A\ 


x\,X2£X 

where we used We use the Cauchy-Schwarz inequality: 


Y c{xi,X2f> 

Xi,X2€X 


/l^l^ fYx^,X2€x4xi,X2y 

V 2 


> 2 ■ 


> 2 


cf) ; 

{Ex^,X2&X^i^YX2)) 

(1 ((2-a-2^-7)A3-6A2))' 


( 6 ) 


(2 - a) 2 A 2 

where the last inequality is because of |A| = (2 — a)A — 2 and ® . We continue 


Y c{xi,X2y> 

Xi ,X2€X 


1 f{2-a-2l3--ffY_i2 (2-«)A3 


(2 - a )2 


(2 - a )2 


> (2-a^2^^A4_6A3. 


2 ( 2 -a )2 


We substitute this in our estimation (151) of C' 


X,Y 


x,Y ^ 1 ^(2-a-2/3- 7 )" A4 

<-^4 A I 


2 ( 2 -a )2 

1 /^(2 -a-2/3-7)2 A^ 

2 ( 2 -a )2 


1 
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We can finally complete our estimation ([3]) of nif. to: 


TOe < ( 2 - a- P A'^ - 
To see that this gives 


4 (2-a-2/3-7)\4 , ..3 


2(2-ay 


5AT 


nie < + 5A^, 

observe first that we may assume that 0 < a < ^ or we are already done. Setting 
a; = /3 + ^, we may also assume x < \. Let f(a, x) = 2 — a — x — ■ 

Note that = —1 + < 0 for 0 < a: < i. Thus we may assume 

that a = 0. Consequently, it remains to verify that 2 — x — which 

is an elementary task. □ 


We note that a good number of elements used in the proof appear already 
in the article of Molloy and Reed m- triangles through e, edges in X, paths 
xyq and 4-cycles between X and Y. Our contribution consists in parameterising 
these elements and then combining the parameters in the right way to give a 
nearly tight bound. 

Lemma m naturally gives an upper bound on the size of the largest clique in 
(G). Although cliques have a very simple structure we were not able to push 
the bound given in Theorem [3] significantly further. 


Proof of Theoreml^ Let if be a largest strong clique, that is, a largest clique in 
L^(G), and let /cA^ be its size. If e S if is an edge of G, then its neighborhood 
induces by Lemma H] a graph of at most |A'* -f 5A^ edges. Thus 


kA^ - 1 
2 


< -A^ 
- 2 


5A3, 


which implies k < -I- ^ -I- < 1.74 for A > 400. 


□ 


4 The sparsity lemma is best-possible 

In this section we describe a family of graphs with strong neighborhoods that 
almost reach the upper density bound of Lemma |4l To show this, we turn to 
Hadamard codes. For every k > 2, the corresponding Hadamard code consists 
of 2 • 2^ 0,1-strings of length 2^ each with certain properties. For instance, for 
k = 2, the Hadamard code is 

{1111,0000,1100.0011,0110.1001,1010,0101} 

Notably, the code always contains the all-O-string and the all-l-string. We drop 
these and interpret the remaining code words as subsets of some fixed ground 
set of n = 2^^ elements {xi,... ,Xn}. Then we can see the Hadamard code as a 
set 'Hk oi subsets of {xi,... ,Xn} with the following properties: 

(i) T-Lk contains 2{n — 1) member-sets, each of which has cardinality ^ 








U V 



1100 0011 1010 0101 0110 1001 
Figure 3: The graph G for /c = 2 


(ii) every element xi lies in precisely n — 1 member-sets of'Hk 

(iii) if S, S' € Hk are two distinct member-sets of Hk then [S' fl «5"| G {0, j}. 

For a proof and more details, see for example van Lint [23]. 

Based on Hk = {S'!,..., S' 2 n- 2 } we construct a graph G. Make a copy 
x'^ for each Xi, and put X = {xi,... ,Xn,x'i,... ,x'j^}. Let, furthermore, Y = 
{yi,..., 2 / 2 n- 2 } be a set of 2n — 2 vertices, that is disjoint from X. We define 
a graph G on X U F U {u, u}, where u, v are two vertices outside X U F. First 
off, make u complete to Xi,... ,Xn and v complete to x'l,..., x'.^, and make u 
adjacent to v. Now, let us add a bipartite graph on X U F. For this, we make 
each Uj adjacent to every vertex in Sj and to the set S'' of their copies. See 
FigureOfor an illustration. With this, each vertex in X has (n— l)-l-l neighbors, 
by property (jnj) of Hk, and each yj has degree 2^ = n, by property 0. Each 
of u and v has degree n -|-1. Thus the maximum degree is A = n -|-1. We note, 
furthermore, that there are (2n — 2)n = 2A^ — 0(A) edges between X and F. 

We calculate the number of 4-cycles in G — u — v. Observe that for all i < j 
the number c{yi,yj) of common neighbors of yi and yj is at most 2 • ^ (by 
property (Hill)). Thus, the number of 4-cycles in G — u — n is 

Since there are only 2A — 1 edges incident with u or v, deleting all these edges 
will only result in a loss of O(A^) edges in L^{G). In the square of the linegraph 
oi G — u — V, however, only the 4-cycles counted above reduce the degree cP{e) 
away from the maximal possible value 2 A^ (recall the argumentation in the 
proof of Lemma ED. Thus, recalling that there are 2A^ — 0(A) edges between 
X and F, we obtain that the number of edges induced by in L^(G) 
satisfies 

2muv > 2A2 • (2A2 - 0(A)) - 4 -F 0(A3)^ - 0{A^) 

(Note that each 4-cycle reduces the degree of four edges in L‘^{G).) Therefore 

rriuv > ^A^ - O(A^), 

which asymptotically coincides with the bonnd of Lemma Ej 
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5 The coloring procedure 

Molloy and Reed m described a probabilistic method, called naive coloring 
procedure in [20) . to partially color a sparse graph. In the procedure, every 
vertex first receives a color independently and uniformly at random. Then 
follows a conflict resolution step: whenever two adjacent vertices received the 
same color, both become uncolored. The brilliant insight here is that, with 
a non-negligible probability, in every neighborhood a good number of colors 
appear at least twice at the end of this procedure. In this way, the partial 
coloring saves colors in comparison to a (A(G) + l)-coloring. This, then makes 
it possible to complete the partial coloring greedily to a full coloring. 

In the easiest manifestation of the procedure, whenever there is a conflict, 
that is, whenever two adjacent vertices receive the same color, both vertices 
lose their color. Molloy and Reed hint at a possible a improvement that is less 
wasteful: if additionally random weights on vertices are chosen, only the vertex 
of lower weight need to be uncolored. However, the analysis becomes much more 
tedious. We propose a different conflict resolution mechanism that allows for 
a simpler analysis. In addition, we not only count colors that are used exactly 
twice in the neighborhood of a fixed vertex but also take into account multiple 
occurrences. Molloy and Reed mentioned such a modification, too, but did not 
discuss it in detail. 

We now describe the modified Molloy and Reed coloring procedure. For this, 
assume G to be an r-regular graph, which we will color with C = |"(1 — yjr] 
colors, where 7 £ (0,1) is a constant. The following random experiment is 
performed. 

1. Color every vertex uniformly and independently at random from the set 

{1,...,G}. 

2. Choose for every edge uv independently and uniformly at random an ori¬ 
entation duv 

3. If uu is an edge and u and v received the same color, then uncolor u if duv 
points towards u and uncolor v otherwise. 

The key parameter is the number of colors that are saved in the neighborhood 
of an arbitrary vertex u, that is, the number of vertices that are assigned a color 
that is already used by some other vertex in the neighborhood. To estimate 
this quantity, we introduce the random variable Pu that counts the number of 
pairs of non-adjacent vertices in N{u) that have, after the uncoloring step, the 
same color. To control overcounting, we also define T„, the number of triples 
of distinct non-adjacent vertices v, w, x in N(u) that have the same color at the 
end of the coloring procedure. 

Most of the effort in the proof of Theorem [S] will be spent on estimating the 
expected values of Pu and Tu- Later, in Section [ 8 j we will show that the random 
variables are strongly concentrated around their expectation: 

Lemma 7. Let 7 £ (0,1). There is an R so that for every r-regular graph G 
with r > R it holds that 

¥[\Pu-E[Pu]\>V^log^r] and 

F[\Tu-E[Tu]\>V^\og\] <r-^'°siogr 
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when the coloring procedure is performed with C = |"(1 — 7 )r] colors. 

In order to prove the existence of a global partial coloring with certain nice 
properties, we use the Lovasz Local Lemma to deduce this from our local struc¬ 
ture. 


Lovasz Local Lemma. Let p S [0,1), and A be a finite set of events so that 
for every A G A 

(i) P[A] < p; and 

(ii) A is independent of all but at most d of the other events in A. 

If 4:pd < 1, then the probability that none of the events in A occur is strictly 
positive. 


Proof of Lemma O How large R has to be will become clear at the end of the 
proof. Consider some graph G with maximum degree at most r satisfying the 
density condition on the neighborhoods. We may assume that G is r-regular, 
otherwise we embed G, as in Lemma 01 in an r-regular graph while keeping the 
local density condition. 

We will color G with G = |"(1 — 7 )r] colors, which may be slightly more that 
the claimed bound of x(G) < (1 — 7 )r. However, by choosing R and thus r large 
enough we can find a 7 ' that still satisfies @ and for which [(I — 7 ')r] < (1 — 7 )r 
for every r > R. Thus, if necessary, we may replace 7 by 7 ' in what follows. 

Pick some vertex u of G. We start with the estimation of Pu, the number of 
non-adjacent neighbors of u with the same (final) color. To this end, consider 
two non-adjacent neighbors v and w of u. Note first that the probability that 
V and w receive the same color in step 1 is equal to Assuming that this is 
the case, we observe that in order for v and w to keep their color in step 3, all 
the edges between {u, ui} and the neighbors of the same color as u, w have to 
be chosen in step 2 so that they point away from {u, w}. Thus, if v and w have 
k common neighbors, the probability that they both keep their (common) color 
is 




2r-2k 


i-M 

4 G 


where the factor | stems from the fact that, out of four orientations of the two 
edges between v, w and a common neighbor, only one of these allows both v, w 
to keep their color. 

In total, we get 


P[?;, w received the same 


I 

G 



color and both stay colored] 

k -, / 1 \ 2r 


3 I 


4 G 


1 


1 ---^ 1 -^ 


G 


I 


2G 


I - 


3 

4C 


c 


1 

4C2 


k 


Note that, for any G > I, we have 



> 1. 


II 




Thus, if r is large enough, then 


P[u,w received the same color and both stay colored] 

> (l + o(l))Tj-^-^e 1-"' 

(1 - 7)r 

As there are at least 5 ( 2 ) many pairs of non-adjacent vertices in N{u), the 
calculation above implies 

We will need a similar estimation for triples of non-adjacent neighbors of u. 
So, assume v,w,x to be three vertices in N(u) that are pairwise non-adjacent. 
For 1 < * < 3, let ki be the number of vertices with i neighbors in {v,w,x}. 
Since G is r-regular, fci -|- 2/c2 + S/ca = 3r. 

The probability that all three of v,w,x receive the same color is The 
probability that all three retain their color is computed in a similar way as 
above, where it should be noted that there are now eight possibilities for the 
orientations of the three edges between v, w, x and a neighbor common to all of 
them. 

Using the binomial series (l-|-z)“ = ^ = o(l/r) for a > 1, 

we get 


'[u, w, X received the same color and stay colored] 


1 




2C 




k2 




^3 




fei/3 


i-gp+odM 


2fc2/3 


1 - 


8C 


ki 


< (1 + 0(1)) _ ^^2^2 (1 8(1 _ 


\ fci/3+2 /c 2/S+fcs 


= (1 + 0 ( 1 ))- 


-e 8(1-7). 


(1 — 7)2r2 

By using that every graph with (5(0 edges contains at most many 

distinct triangles (see, for instance, Rivin M), we obtain 

1 


iE[r0 < (i + o(i)) 


j3/2^3 


6 (1 — 7)2r2 


g 8(1-7) 


7 

= (l + o(l)) g(l_7)2 e 

By the inclusion-exclusion principle, we save at least — Tu many colors in 
N{u), that is, the number of colored vertices minus the number of colors actually 
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used in N{u) is at least P„ — r„. Now, if Pu — > 7 r, then the number of 

uncolored vertices is smaller than the number of unused colors in N{u). Thus, 
if we can show that, with positive probability, it holds that Pu — Tu > yr, then 
we can color all remaining uncolored vertices greedily, which then concludes the 
proof. 

To show this, let be the event that 



Since both random variables Pu and Tu are highly concentrated (see Lemma [7]), 
it follows for large r that 


P[^«] < 


Note that the event only depends on Au' if there is some vertex z of distance 
at most 2 to both u and u'. Thus, Au is independent of all other Au' except 
for a number of these that is polynomial in r. We deduce, therefore, from the 
Lovasz Local Lemma that there is a coloring of the vertices of G such that no 
Au holds. This, however, implies together with ([T]) that > yr, provided 

that r is large enough. □ 

6 How to possibly save more colors 

The factor of 1.93A^ of Theorem [1] is still very far from the conjectured factor 
of 1.25A^. While it seems doubtful that probabilistic coloring can ever get very 
close to 1.25A^, there is still some hope that with more sophisticated arguments 
the factor can be improved. For us, two observations fuel this hope. 

First, the two steps, the sparsity lemma and the coloring lemma, are com¬ 
pletely dissociated. That is, the coloring lemma only exploits the sparsity of 
strong neighborhoods but uses no structural information whatsoever. Surely, 
not forgetting that the task consists in coloring edges should help! 

Second, while the sparsity lemma, Lemma |4] is asymptotically tight, the 
conjectured extreme example for the strong coloring conjecture, the blow-up of 
the 5-cycle, has much sparser strong neighborhoods. 

To be more concrete, consider the blow-up in which every vertex of the 5- 
cycle is replaced by a stable set of size k, so that the maximum degree becomes 
A = 2k. Then every edge e has as strong neighborhood all of the rest of the 
graph, and consequently, the strong neighborhood induces about ||A^ edges 
in the square of the linegraph, which is much less than the §A'^ of Lemma SI 
In some sense, this is not surprising because already the degree cP{e) of e in 
the square of the linegraph is much smaller than the 2A^ we are working with, 
namely it is only about |A^. In conclusion, the blow-up of the 5-cycle is quite 
different from what we assume in the sparsity lemma. 

What kind of effects could be at work that explain this difference? In 
Lemma S] we assume that the edge e has degree d^{e) close to 2A^ and at 
the same time a very dense neighborhood. While it is possible to have such 
edges, this cannot be the case for all edges. Indeed, for the strong neighbor¬ 
hood to induce many edges, there have to be many 4-cycles in iV|. (This is 
simply because X and Y, the hrst and second neighborhoods of the endvertices 
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of e, cannot be very large so that the bipartite graph between X and Y is very 
dense.) However, every 4-cycle reduces in L‘^{G) the degree of any of its edges 
by 1, and consequently, there should be many edges / in iV| of lower degree 
(f{f) than 2A^. 

Obviously, an edge / of low degree (P{f) is to our advantage, as we can 
always color / at the very end when all high degree edges are already colored. 
But if we defer coloring of low degree edges then any high degree edge e with 
many low degree edges in its strong neighborhood has, morally, low degree as 
well. Unfortunately, we did not manage to exploit these observations in such a 
way that they result in a substantial improvement. 


7 Talagrand’s inequality and exceptional out¬ 
comes 


To finish the proof of Lemma [5j we still need to show that the probability of Pu 
or T„ deviating significantly from their expected values is very small. To prove 
that a random variable on a product probability space is strongly concentrated 
around its expectation is a very common task, and consequently, a number of 
powerful tools have been developed for this, among them McDiarmid’s, Azuma’s 
or Talagrand’s inequality mnsM- All of these tools have in common that they 
require the random variable to be somewhat smooth. 

Consider a family of probability spaces ((H^, and let (r2,E,P) 

be their product. One common smoothness assumption for a random variable 
X : O —>■ R is that each coordinate has effect at most c: whenever any two 
G ^ differ in exactly one coordinate then |X(a;) — X{oj')\ < c. 

For McDiarmid’s or Azuma’s inequality to give strong concentration in this 
situation, the effect c has to be small compared to n. If that is not the case, 
then Talagrand’s inequality might still be useful. We describe a weaker version 
that is easier to apply than the full inequality. 

We say that X has certificates of size s for exceeding value k if for any uj G 
with X{u!) > k, there is a set I of at most s coordinates such that also X {uj') > k 
for any uj' G 0. with ui\i = The following version of Talagrand’s inequality 
appears in Molloy and Reed [501 P- 234]: 

Theorem 8 (Talagrand). Let ((Hi, E^, he probability spaces, and let 

(H, E,P) be their product space. Let A : H —>■ R 6e a non-negative random 
variable with X Q so that each coordinate has effect at most c, and assume X 
to have, for any k, certificates of size at most k£ for exceeding k. Then for any 
0<t< E[A] ; 


P 


IA - E[A]| > t + mc^/ew,[x] 


< 4g sc'-‘n[x] 


Unfortunately, for the random variables and that are of interest for 
us, the effect c is too large, so that Talagrand’s inequality becomes useless. 
Indeed, in an extreme case changing the color of a single vertex v G N(u) to 
new color A might result in all other vertices in N{u) of color A to lose their 
color. This happens when all these vertices have an edge pointing away from 
V. Then, all pairs of color A counted in Pu are lost, and these might be up to 
r^. (Recall that our graph G is r-regular.) Consequently, the effect c has to be 
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at least - however, already an effect of c ~ r would necessitate a deviation 
of t ~ r Ri E[Pu] for the probability to become small. But for Talagrand’s 
inequality to be useful for us, we need a vanishing probability for deviations t 
that are small compared to E[Pu]. 

Changing a single color might have a very large effect but this is a rare 
exception. Normally, only few vertices in N{u) have the same color, so that 
also only few are affected by any color change. That is, very large effects only 
occur with a very tiny probability. It seems unreasonable that exceedingly 
unlikely events should have a serious impact on whether a random variable is 
concentrated or not. 

What we need, therefore, is a version of Talagrand’s inequality that excludes 
a very unlikely set fl* of exceptional outcomes that nevertheless spoils smooth¬ 
ness. Warnke [M] (but see also Kutin [15]) extended McDiarmid’s inequality 
in a similar direction by considering a sort of typical effect. However, Warnke’s 
inequality is still too weak for us. Grable [3] as well presents a concentration 
inequality that excludes exceptional outcomes, which would be suitable for our 
purposes, were it not for the fact that there is a serious error in its proof. McDi- 
armid [T5], too, describes a Talagrand-type inequality that excludes an excep¬ 
tional set. (Its main feature, though, is to allow permutations as coordinates.) 
The inequality, however, does not seem to be of much use to us either. 

We mention that the powerful, but technical, method of Kim and Vu [14] 
can also handle large but unlikely effects. 

To deal with exceptional outcomes, we modify the definition of certificates. 
Given an exceptional set H* C H and s, c > 0, we say that X has upward (s, c)- 
certificates if for every t > 0 and for every lo G H \ fl* there is an index set I 
of size at most s so that X{uj') > X{uj) — t for any ui' G ft \ ft* for which the 
restrictions uj\i and oj'l/ differ in less than t/c coordinates. 

Directly, Talagrand’s inequality does not give concentration around the ex¬ 
pectation but around the median med(X) of X, that is, around 



However, in typical applications the median is very close to the expected value. 
We will deal with this technicality later. 

Lemma 9. Let ((Hi, Sj, Pi))(hj^ be probability spaces, and let (H, E,P) be their 
product space. Let ft* Q ft be a set exceptional events. Let X : ft ^ W be a 
random variable and let t > 0. 

If X has upward {s,c)-certificates then 


P[|X - med(X)| > t] < 4e"i^ -b 4P[H*]. 


(7) 


We prove Lemma [9] with the original, full version of Talagrand’s inequality. 
Recall that the Hamming distance of two points oj,uj' G ft is defined by the 
number of non-common coordinates or equally ^ weighted ver¬ 

sion (together with a normalization), let a G K." be a unit vector with at > 0. 
We define the a-Hamming distance between lo and uj' by 
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For a set A S and a point uj G fl, let 


d{u}, A) = sup < T : ^ OLi>T for all d G A 

which is equivalent to the largest value r such that all points in A have a- 
Hamming distance at least r to w (for a best possible choice of a). 

Theorem 10 (Talagrand [H]). Let ((fli, he probability spaees, and 

let (f2,E,P) be their product space. IfA,Bcrt are two (measurable) sets such 
that d{uj, A) > T for all oj G B, then 

P[A]P[B] < 

Proof of Lemma [H The two-sided estimation ([T]) follows from the two one-sided 
estimations 



P[X < med(X) - t] < 2e~^ + 2P[n*] 

P[X > med(X) +f\< 2e"i^ -h 2P[rj*], 

of which we only show the first; the argumentation for the second is analogous. 
Let 


^ = {w € fl \ n* : X{oj) < med(X) — t} and 
B = {oj G Ll\Ll* : X{u!) > med(X)}. 

Pick an arbitrary uj G B. By assumption, X has upward (s, c)-certificates, which 
means, in particular, that there is an index set I of at most s indices such that 
uj\j and w'|/ differ in at least t/c coordinates for every uj' G A. Consequently, if 
we set a = l/-^/|T[ • 1/, where 1/ is the characteristic vector of /, then uj and 
oj' have a-Hamming distance at least Hence d{uj,A) > 

Using Theorem [TUI we obtain P[H]P[H] < We conclude: 

P[X < med(X) — t] ■ - < P[X < med(X) — t]P[X > med(X)] 

< P[H]P[H] -h P[U*] 


This completes the proof. □ 

Next, we prove that usually the median is close to the expected value: 

Lemma 11. Let ((0^, E^, Pi))(Lj^ be probability spaces, and let (r2,E,P) be their 
product space. Let LI* Q Q. Let NT : H —>■ R &e o random variable, let M = 
max{sup|X|, 1}, and let c>l. If X has upward {s,c)-certificates, then 

|E[X] - med(X)| < 20cy/s + 20M^¥[n*]. 

We note that the proof technique is not new. A similar proof appears, for 
instance, in Molloy and Reed Uni Ch. 20]. 
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Proof. Clearly, 


|E[X] - med(X)| < E[|X - med(X)|]. 

Put K = [2M/cy/s\, and observe that \X — med(X)| < {K + l)c-v/s. By 
splitting the possible values of \X — med(X)| into intervals of length c^/s, we 
can upper-bound 


K 

E[|X - med(X)|] < ^ cy/^{k + 1)P[|X - med(X)| > fccy^}]. 
k^O 

We apply Lemma [5] for each summand with t = kc^/s: 

K 

E[|X-med(X)|] < 4(fc-h -f P[ff*]) 

< 20cv/s-b20M2p[f7*], 

asEfco(* + l)e"'''^^~ 4.1869 <5. □ 

We conclude under the assumptions as in Lemma [TT] by combining Lemma [5] 
andfm 

P[|X - E[X]| > t + 20c^/s + 20M‘^V[n*]] < 4e"i^ + 4P[ff*]. (8) 

Let us come back to the certificates. As defined, they are witnesses for large 
values of X. Sometimes, it is easier to certify smaller values. To capture such 
situations we say that X has downward {s,c)-certificates if for every t > 0, 
and for every w € \ there is an index set I of size at most s so that 

X{uj') < X{uj) -I- t for every w' S \ for which the restrictions a;|/ and uj'\i 
differ in less than t/c coordinates. By replacing X with —X we observe that 
Lemmas 151 and [TTl and thus (|5]), remain valid for downward certificates. 

We simplify (|5|) a bit more. If t > SOc-y/s and P[f2*] < M~‘^ then t > 
t/2 -h (20c^ -b 20M^F[n*]). Thus: 

Theorem 12. Let ((0^, Pi))”^j^ be probability spaces, (n,E,P) be their prod¬ 
uct, and let id* G be a set of exceptional outcomes. Let A : —>■ R fee o ran¬ 

dom variable, let M = max{sup |A|, 1}, and let c > 1. Suppose P[f2*] < 
and X has upward {s,c)-certificates or downward {s,c)-certificates, then for 
t > 50c^/s 


P[|A-E[A]| > t] < -b4P[n*]. 

8 Concentration for coloring procedure 

We finally complete the proof of Lemma [5] by proving that for every vertex u 
the random variables Pu and are tightly concentrated around their expected 
values. Recall that P„ counts the number of vertex pairs in N{u) that at the 
end of the coloring procedure have the same color, and recall that counts the 
number of such vertex triples. 
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Proof of Lemma We show the statement for Pu, the proof for Tu is almost 
the same. 

Recall that the coloring procedure that defines Pu (and Tu) is based on a 
random experiment on a product space Lt = where the product ranges 

over the vertices (that receive a color) and the edges (that receive a direction) of 
the graph. Thus, every event a; G is a vector that is indexed by V (G) U E{G). 

To apply Theorem[T21 we define the set Ll* of exceptional events as the space 
of events that assign some color to more than logr vertices in N{u) (before 
uncoloring). Recall that we work with C = |"(1 — 7 )?'] colors. To estimate 
observe first that the probability that a particular color appears more than log r 
times in N{u) is at most 


E 

i=[log rJ+1 


S (?)‘ 


1 


i=[log rj+l 


(1 — 7)®r’ 


< r ■ 


(1 - 7 ) logr 


log r 


Thus, we get 


F[n*] < ■ 


( _ ^ _ 

V(1 - 7 ) logr 


log r 

< r~i 


for large enough r. 

Setting s = 3r and c = log^ r, let us check that has downward (s,c)- 
certificates. So, let f > 0 be given and consider an event oj ^ fl*. We have to 
define an index set I of size at most s. We start by including all vertices of N{u) 
in I. Next, consider any vertex v G N(u) that becomes uncolored in step 3 of 
the coloring procedure. This is only the case, if there is a neighbor v' of v that 
receives the same color under ui and if, again subject to w, the edge vv' points 
towards v. We add v' and vv' to I for each such vertex v. In total, we have 
|/| < 3r = s as for every vertex v G N(u) we add at most two additional indices 
to I. 

Let now oj' ^ fl* be an event with Pu{iv') > Pu{uj) + t. As ui' ^ Lt*, every 
color may contribute at most (*°| ^ log^ r pairs to Pu ■ 

For every color A for which there are more pairs of vertices colored with A 
contributing to Pu{oj') than to Pu{uj), there is a vertex v in N{u) of color A 
under oj' but that, under w, is either uncolored or colored with a different color 
than A. 

In the latter case, w and lv' differ in the coordinate v G I (as v receives 
different colors under oj and tv'). For the former case, observe that we had 
added a vertex v' and the edge vv' to I to witness v being uncolored under oj. 
Either the color of v' or the direction of vv' must be different in oj', that is, uj 
and uj' differ again in at least one coordinate of I. In both cases, call such a 
coordinate a X-difference. 

Can a coordinate in / be a A-difference and a ^-difference for two different 
colors A and Yes, but only if it is a vertex u' G / in N{u) that satisfies two 
conditions: first, under to it serves as a witness for one of its neighbors v G N{u) 
losing its color A, say, in the conflict resolution step of the coloring procedure; 
and second, by flipping from color A under tv to color /r in tv' the vertex v' 
contributes new pairs of color /r in This then immediately shows that no 
coordinate can be a A-difference for three (or more) colors. 
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As a consequence, w' and w must differ in at least t/ log^ r = t/c coordinates 
in I as Pu{oj') > -P«(w) + 1. (Recall that under uj' no color can contribute more 
than i log^ r pairs to P„.) This proves that Pu has downward (s, c)-certificates. 

For Theorem fT^ we set M = supPu < r^. With t = log^ Theorem fT^ 
implies for large r that 

P[|P„-E[P„]| >yflog3r] <4expf- ^ ) +4r-§'°®'°s- 

V 16 log r ■ 3r/ 

< r~i log log r 

The only difference of the proof for lies in the fact that, outside 17*, every 
color can contribute up to log^ r /6 triples to T^; for P„ this was at most log^ r /2 
pairs. The resulting higher value log^ r for c can easily be compensated for by 
increasing t to log^ r^/r in the application of Theorem [T^l □ 

9 Exceptional outcomes spoil triangle counting 

We close this article by arguing that Theorem [T^] has potential applications 
beyond our coloring lemma. To make this case, we discuss the number of trian¬ 
gles in a random graph. We note that this problem also serves as a motivating 
example for Kim and Vu El, and for Warnke m- 

Consider the random graphs Q{n,p) that are obtained from Kn by deleting 
uniformly and independently at random an edge with probability 1 — p, where 
p = p{n) may be a function in n. The threshold probability for the triangles is 
p = below that threshold there is with high probability no triangle, above it 
there is with high probability at least one. Let us now examine the number T of 
triangles in 0{n,p), or rather the expected number of triangles E[T] = ( 3 )p^ ~ 
\n^p^. 

Is T concentrated around its expected value, whenever np —>■ oo? We con¬ 
sider here a relatively mild notion of concentration, where we allow deviations 
from the expected value of up to t = en^p^ for small but fixed e > 0. For simpler 
notation, set N = 

For p relatively large, that is, for p > for any 7 > 0, McDiarmid’s 

inequality m is strong enough to show concentration. Indeed, changing any 
coordinate, that is, any edge, may result in at most n new triangles (or at 
most n triangles less), so that the effect c of each coordinate is bounded by n. 
Consequently, for McDiarmid’s inequality to show that jT — E[r]| < t for t = 
en^p^ it is necessary that tends to infinity. This is the case if p > 

A version of McDiarmid’s inequality for binary random variables, see m again, 
allows to drop this threshold to p > 

To go below this threshold, Warnke [21] (but also others) observed that, 
while changing a single edge may create up to n — 2 new triangles (or destroy 
that many), this is exceedingly unlikely. Indeed, we only expect a particular 
edge to be in roughly np^ many triangles. Thus, by the standard Chernoff 
bound, it is, for any (5 > 0 , extremely unlikely that an edge is contained in more 
than max{ 2 np^, n"^} triangles. 

Exploiting the fact that, typically, the effect of changing a single edge is much 

,2 

smaller, Warnke could verify concentration as long as pjv.max{ 2 np^ n* ' } to 

infinity, which is the case when p > . 
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To go even below such p, Kim and Vu m developed a powerful method that 
evidently has a very wide scope of application. Usually, great power does not 
come for free, and this is also the case here: Kim and Vu’s inequality is rather 
technical and not easy to use. 

Let us now apply Theorem [12] and show the strong concentration of T also 
for values of p smaller than We exclude all outcomes where at least 

one edge is contained in more than many triangles. As seen above, this is 
an event of very small probability. Moreover, we may use Warnke’s result (or 
a previous application of Theorem (HI) to observe that it is extremely unlikely 
that T > . (We have used here that T is monotone 

in p.) We add all these outcomes to our exceptional set. 

Next, let us check that T has upward (s, c)-certificates, where s = 
and c = . For a non-exceptional event w, we use as index set / the set of all 

edges lying in any triangle. As there are at most triangles, it follows 

that |/| < s. Now, consider some non-exceptional event uj' such that uj and uj' 
differ in less than t'/c coordinates of I. Then, any edge in I that is present 
in (jj but lost in w' may only result in w' having at most less triangles than 
UJ. Moreover, edges outside / obviously cannot result in the loss of triangles. 
Therefore, T{uj') > T{uj) — t', and we see that T has upward (s, c)-certificates. 

With these values of s and c, we deduce from Theorem fT^ that T is strongly 
concentrated if p > 

For even smaller values of p, we may apply Theorem [T^l once again, and 
set s ~ which then will yield concentration for p > 71 “^®/^°+^. Of 

course, this can iterated several times, so that we get strong concentration for 
p > 71“^+^ for every /3 > 0. 

Let us finally point out that we even have fairly tight concentration around 
the expected value: namely, T is very likely within the range E[T] ± 
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